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QQ ' It is known that mutually unbiased bases, whenever they exist, are 

T-H , optimal in an information theoretic sense for the determination of un- 
known state of a quantum ensemble. These bases may not exist in 

\^j , most dimensions and some suboptimal choices have to be made. The 

.'^ ' present paper deals with estimates of the information loss in subopti- 

f— ^ , mal choice of bases. The information is calculated directly in terms of 

T— I ' transition probabilities. I give estimates for the information content of 

^^ , measurement in some approximate MUBs proposed recently. 

O ■ 1 

:^: 1 

Dh: 

-^ I The state of a quantum system is completely specified by a ray in a complex 

j^ ■ Hilbert space TC, or more generally by a density matrix. A density matrix 

^ . is a positive operator on TC with unit trace. Thus, a density matrix has 

ri ] nonnegative eigenvalues whose sum equals 1. In particular, it is hermitian. 

, ^ ' The Hilbert space TC is in general infinite dimensional. However, if we con- 

^ ■ fine our attention to some physical quantities like spin or polarisation then 

the corresponding space is finite dimensional. The complete space of the 
system is the tensor product of this finite dimensional space and an infinite 
dimensional space which correspond to physical quantities like momentum, 
energy, angular momentum etc. As long as the interaction between these 
two types of quantities is negligible we may consider them separately since 
the complete state is product or "unentangled" state. Henceforth, we will 
consider finite dimensional spaces mostly. The finite dimensional case is of 
paramount interest in quantum computing and information. 

Often the state of the quantum system is not known and has to be 
determined by certain tests. For this, we need an ensemble. Imagine for 

^A shorter version of this paper appeared in the proceedings of EQIS 2005 
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example, a preparation apparatus designed to prepare quantum systems in 
some arbitrary dimension n (qunits!) in some specified pure state. Noise 
in the apparatus will distort the qunits and what we will get is distribution 
over pure states: a density matrix. Similarly, in the context of tomographic 
quantum cryptography jTj we have to choose a set of positive-operator valued 
measures( POVM) s for measurement. What is the optimal choice of such 
POVMs? This has been answered for projection valued measures( PVM) 
by an information theoretic analysis 0. Let us distinguish two problems 
concerned with general measurements. The first is the problem of estimation 
or hypotheses testing |3]: given a measurement procedure and the data find 
the best estimates of the state that produced the data , assuming some 
prior distribution on the state. A recent thorough analysis may be found in 
[1] . The second problem may termed as a design problem. Given a class of 
measurement to determine the parameters characterizing the state, find the 
"best measurement". Now, the best measurement will most likely depend 
upon the state, so we look for the best measurement on the average. That 
is we optimize the average information gain corresponding to the different 
measurements from the given family. The present paper is concerned with 
the second problem. 

Let 7^ be a Hilbert space of dimension n. Let V{'H) be the set of her- 
mitian operators on TC. The dimension of V{7i) as a real vector space is 
n?. Let i^{'H) C V{TC) be the set of positive operators with trace 1. It is a 
convex set. The map L : V{n) -^ V{n) such that L{T) =T- l/nTr(r) • / 
is linear and the image 1{T~L) is the space of hermitian operators with trace 
0. It has dimension n^ — 1. Here / denotes the identity operator. The affine 
space l{7i) +1 /n consists of all hermitian operators with trace 1. Therefore, 
a density matrix is completely specified by n^ — 1 parameters. 

The state of a quantum system is not directly measurable. A measure- 
ment yields only probabilities. We assume for simplicity that all hermitian 
operators are observable. Thus, let p G Vt{7i) is a state and A be hermitian 
operator. Let A = X^j Cj|ai)(ai| be the spectral decomposition of A. Here 
we use the familiar Dirac notation: {a\ is the unique dual vector (via the 
inner product) of the vector |a) in Ti. A measurement of A will record one 
of the eigenvalues q with probability Tr(|aj)(ai|/9) = {ai\p\ai) The space of 
linear operators on 7i becomes a Hilbert space of complex dimension n by 
defining the inner product 

(B,C7)=Tr(i?tc). (1) 

The corresponding norm is the Frobenius norm. Its restriction to V{7i) 
makes the latter a real inner product space of dimension -n? . Let us assume 
that A is nondegenerate. The probability of obtaining the i outcome is 

Pi = Tr{\ai){ai\p) = {\ai){ai\, p). (2) 



The probabilities may be interpreted as projections of the state p onto the 
corresponding "coordinate" vector |ai)(ai|. Of course, here a vector means 
an element in the space V{7i). It is more convenient to consider the traceless 
hermitian operators p — I jn instead of p. In any case a measurement on an 
ensemble in some basis can at best give us an estimate of the r? probabilities 
of the possible outcomes. We may thus characterize a measurement of a 
nondegenerate observable A by the corresponding orthonormal basis in the 
spectral decomposition. We shall henceforth simply refer to the bases of 
measurement. Of the r? probabilities obtained by measurement in some 
basis B only r? — \ are independent since X^jPi = 1. Let Pi = |a^(a| 
projection operators corresponding to B. They satisfy PiPj = SijPi and 
^j Pi = I. As vectors in V{H) they are linearly independent. To avoid 
confusion we call the projection operators corresponding to some basis in 
the ambient space 7i projection vectors or simply projectors when they are 
considered as elements of V{7i). But there are only n of them. Further, if 
we have two bases Bi and B2 then at most n — 1 projectors from B2 can be 
independent of those in Bi due to the relation ^^ Pj = I. Hence to get the 
n? — 1 coordinates of p we need n + 1 bases such that they are independent 
in the following sense. Pick the first n — 1 projectors each from each of 
the basis. If they are independent after the affine transformation described 
above they constitute a basis in 1(H). We call such a set of projectors a 
complete set of measurement bases( CSMB for short). Suppose we have two 
CSMBs Si and 1S2. If all other conditions remain same which one should we 
pick for determining the unknown state of a quantum ensemble? We may 
assume ideal conditions- perfect preparation procedures, perfect detectors 
and measuring devices etc.- to compare the two. In a classic paper |2] 
Fields and Wootters proved that a set of mutually unbiased bases (MUBs) 
is an optimal choice. Two sets of orthonormal bases {|aj)} and {|/3j)}are 
said to be mutually unbiased if |(ai|/3j||)^ = 1/n. They further go on to 
show that such bases exist whenever the dimension n is a prime power, 
extending earlier work of Ivanovicl^l who proved the existence of MUBs 
in prime dimension. These works however left open the question of the 
existence of MUBs for n which divides two or more distinct primes e.g. 6. It 
is now widely believed that MUBs do not exist in such dimensions. However, 
we can expect CSMBs which approximate MUBs. Then it is natural to ask: 
how much do we lose in the approximation process. This question is relevant 
even in the cases where MUBs are known to exist because in more realistic 
situations the measurement apparatus will only approximately implement 
the MUBs. However, to answer such questions we must have an appropriate 
framework in which these questions may be posed and answered precisely 
and quantitatively. The natural candidate seems to be information theory. 
In this work I elaborate on the information content of a quantum mea- 
surement process. This was partly done in [2]. I then give estimates for the 
information content of measurements in CSMBs which approximate MUBs. 



Even in the case where MUBs are known to exist there is always a margin 
of error. So even here it is reasonable to estimate the information content 
of CSMBs. 

I point out that information optimization appears in a different context in 
estimation theory, namely, hypothesis testing. Given some prior information 
about the distribution of states and the outcomes of some experiment we 
seek for optimal choice of state form the experimental data. ' 

2 Information content of a measurement 

In this section we follow [S] to define the information content of a mea- 
surement and apply it to the case of measurements for the determination 
of the quantum state of an ensemble. Let A^ be a measurement on some 
system S. We should use the term "experiment" rather than measurement 
since the latter seems to imply a single measurement. Let S be character- 
ized by some parameters denoted by 9 which will usually be drawn from 
some subset of M , the /c-dimensional Euclidean space. Let p{6) represent 
the a priori probability distribution of the parameters 6. Corresponding to 
every value of 9 there is a probability measure on X- the set of possible 
measurement data which is again a subset of some Euclidean space. We 
assume that this measure is given by p{x\0)dx. f^p{x\9)dx is the condi- 
tional probability of getting the outcome x in S C X given the state 9. Let 
p(X.) = f-^p{x.\9)p{9)d9 be the probability density of the random variable 
X. Note that, we have used the same symbol p for the probability densities 
of different random variables. This does not of course imply that they are 
the same functions. The notation is more convenient and unambiguous if 
taken in proper context. Moreover, we do nit differentiate between a random 
variable and its values. In an experiment we often are often interested in 
the posterior probability p{9\x.). That is, given the measured values x the 
probability density for which in turn gives us the probability of the state. 
This is the primary problem in estimation theory and hypothesis testing. 
The information content of the measurement A4 is defined as 

3{M,p{9),x)= f p{e\x) log p{9\x)d9- f p{e) log p{9) d9 . (3) 

li p{9\x) = then the integrand is defined to be zero and the logarithm is 
taken over an arbitrary but fixed base. The justification for this definition 
is as follows. Consider the term 



3o= I P{0) log p{9)d9. 

It is supposed to represent the prior information about the state 9. Let us 
take a simple example to illustrate an important property. Suppose it is 



known that the state 6 is found in 0' C with probabihty q. Let Ti be a 
measure of information corresponding to the knowledge whether 6 is in 0' 
or its complement. Let I2 and X3 be the amount of information gained in 
the next phase when get the value of 9 in 0' or its complement respectively. 
Then a fundamental additive property required of the information measure 
is that the total information 

I = Il + ql2 + il- q)l3 (4) 

Then it is not difficult to show that the information measure 3q is unique 
up to a constant multiple. We do not discuss these points further but refer 
the reader to any good source on basic information theory e.g. and 
for a discussion in the context of experiments. The difference between the 
posterior information 3i = J p{9\x.)\ogp{9\x) d9 and the prior information 
Jo = [{]Tr^)^ogp{9) d9 is the net information gain. It depends upon the 
experiment and the distribution of the data x. Thus we may say that one 
experiment or measurement is more informative than other. Let us calculate 
information content for some simple measurements in the quantum domain. 
Let the dimension n = 2. Suppose we have prior information that the 
state is a pure state |0) or |1) with probability 1/2. We may therefore 
model the parameter space as = {0, 1} with p(0) = p(l) = 1/2. Then 
% = 1/2 log (1/2) + 1/2 log (1/2) = -1. The logarithm is taken to the base 
2. Now suppose that we choose to make measurement Mi in the basis 
{[0), |1)} which is natural, given the prior information. Then the conditional 
probabilities may be conveniently written in the matrix form, for i,j € {0, 1} 

P(^|J) = (J J 

It is simply the unit matrix of order 2. Thus, if we get the measurement 
outcome we are sure that the state of the system was |0) etc. Then it is easy 
to see that mfi{Mi,i) = and hence J(A4i,z) = Ji{Mi,i) — Jq = 1- Now 

suppose we perversely choose the basis | — ) = ^^(lO) — |— )) for measurement 
A^2- Then the corresponding conditional probability matrix is 

^^^'^■) = G/2 1/2 

Again it is easy to see that the information gain in this case is 3{A42,i) = 0. 
That is we get no information from M2- In fact, the average information, 
to be defined below, is zero. This is of course intuitively obvious from the 
choice of basis in M.2- 

The information measure defined above depends on the state and may 
be negative. But the average information 

J{M,p{9)) = 1 3{M,p{9),x)p{x)dx (5) 



is independent of the state and is nonnegative ^. Here,the probability 
density 

j5(x) = f p{x\e)de (6) 

is the mean probabihty distributions averaged over 6. It is not difficuh to 
show that 

j{M,p{e))= I f p{e)p{x\0) log {p{x\e))dxde - f p{x)\og{p{x))dx (7) 

This is the formula we use estimate the information gained in quantum 
measurements. 



3 Quantum state tomography and MUBs 

Given an n-dimensional quantum ensemble in an unknown state how do 
we determine its state? This is the problem of quantum state tomography. 
The state is not directly observable but we may infer it from the proba- 
bility distributions observed. As mentioned in the Introduction we need 
(projective) measurement in n + 1 bases to determine the state completely 
from the observed probabilities. Actually, the state tomography problem 
has broadly two theoretical aspects. The first is a design issue. What is the 
optimal choice of bases? The second aspect is a problem of decision or esti- 
mation theory: for a given measurement what is the best possible estimate 
of the parameters characterizing the state? In this paper we will be mainly 
concerned with first aspect. So let us formulate the problem precisely now. 
Given an ensemble of quantum systems in some unknown state p. By 
an ensemble we mean an unlimited supply of identically prepared quantum 
systems. Let B^, . . . , ^"■+^ be n + 1 base in H with 

S'= = {PfEE|af)(af|}ti (8) 

As vectors in the n^-dimensional space V{Tl) at most n^ of them can be 
linearly independent. Due to the relations ^^ P^ = / for all k we can 
only have all the n vectors from exactly one basis in any independent set 
and if ;B = IJ^ B^ contains a maximal independent set then we choose the 
first n — 1 projectors Pf , i = 1 • • • n — 1 and - as a basis for V{H). Let 
s^- = (P^,Py) = Tr(P^P'). The nonnegative numbers sj^- are the respective 
transition probabilities among the vectors in the A;*^ and l^^ basis. Note 
that s^^ = 6ij since each of the basis is orthonormal. It was seen in Section 
1 that {P^ — I/n : i = 1, . . . ,n — 1 and k = 1, . . . , n + 1 form a basis for 
liTC), the space of traceless hermitian operators. Thus, for a state p let 

p-I/n = ^yH^'l-I/n)^Yl v'^ (9) 



Then, 

Tr((p - I/n)Tl) = p\ - 1/n = ^ V^T! ,T]) = ^ t^'yf (10) 

It is easy to see that if the original bases are mutuahy unbiased then 
{T^,Tn = for k ^ I, that is the operators T^ and T-, k ^l are orthogonal 
when considered as vectors. Here 

4' ^ {TtT]) = Tr{T^T]) = sfj - 1/n (11) 

and p^ is the probability of i*^ outcome in the measurement in the k^^ basis. 
If we consider the parallelepiped spanned by the vectors TJ' then i^j are the 
angles between the sides T^ and Tj. Notice also that the input parameters 
characterising the state (denoted by 6 earlier) are the components y^. We 
will denote these by a vector Y. 

By a measurement we mean a collection of several observations in differ- 
ent bases on subensembles of the original ensemble. We picture a massively 
parallel setup where we have a several measuring devices V/^ for each basis 
B . Th original ensemble is divided into large subensembles and tested by 
each of these n + 1 groups of devices. For each k < n + 1 we get frequencies 
m^ for the i outcome, l<i<n — lin the k device group. The numbers 
m^ constitute the measurement data x in |21 and [7| What is a reasonable 
probability distribution for the m^7 Here we appeal to the local limit the- 
orem [5] in probability theory which roughly states that for independent 
discrete random variables the probability distribution of their frequencies 
tends to the normal distribution in the limit A^ — > oo, A'^ the number of 
trials. We must have some prior distribution for the states. Let V be 
the volume of the parallelepiped spanned by the vectors TJ'. Assuming a 
uniform distribution for the states it can be shown that'? that the informa- 
tion gain in a quantum test is proportional to InT/ apart from an additive 
constant. We will in fact take InV as the measure for information content 
of a quantum test of an ensemble in CSMB and for a CSMB C write 2{C) 
for the information gain and V{C) for the corresponding volume. The first 
result I prove was already given in |2j but the present approach is different. 

Theorem 1 Information gainX{C) is maximum if and only if C consists of 
mutually unbiased bases. 

Proof. From the preceding discussion, we have to show that the volume 
V{C) spanned by the vectors T^ is maximal iff T^ and T- are orthogonal for 
kj^l. Notice first that (I;^ TJ^) = 6ij - 1/n. Consider the (n^ - 1) x (n^ - 1) 
matrix r(C) = (i^j) = {T^,Th and assume the ordering defined by the pair 
{k, i}. This simply means that the matrix consists of n + 1 blocks 7 , each 



a square matrix of size {n — 1) such that j (ij) = t 



u' 



no 



/7II 7I2 ... -yl^+l \ 

^21 ^22 ___ ^2n+l 



V ^n+l,n+l 



(12) 



If we choose any orthonormal basis for 1{'H) and express Tj' in this basis. 
Let T be the corresponding real matrix of the coefficients then it is clear 
that TT* = r(C), where A* is the transpose of A. It follows that detr(C) = 
{V{C))'^ and T{C) is positive definite. Thus maximizing V{C) is equivalent 
to maximising r(C). Below we will focus on the latter. From the generalised 
Fischer-Hadamard inequality [HI it follows that 

det r(C) < det 7^^ • • • det 7"+i."+i (13) 

The rhs is determinant of the product of the diagonal blocks in TC. Now, if 
the TJ^ are orthogonal then the off-diagonal blocks are all zero matrices and 
the equality holds in eq. (fT3|) . This proves the sufficiency part. 

The equality holds in ^J only if the following condition is satisfied ^ . 
Let S be the (n + 1) x (n + 1) matrix such that S{ij) = 1 if 7*-^ 7^ 
and S{ij) = otherwise. Then the equality holds if and only there is 
permutation matrix of order n + 1 such that PSP~^ is triangular. Since F 
is symmetric and P~^ = P* it follows that if PSP~^ is triangular it must 
be diagonal. The operation S — > PSP~^ permutes the diagonal elements of 
S among themselves. Hence, PSP^^ is diagonal iff all off-diagonal elements 
are zero. That is, 7*-' = for i ^ j- That is the original bases are mutually 
unbiased. The necessity is proved. 

The above theorem gives an upper bound. A natural question is: how 
tight is the bound. This is related to the estimation of the information 
content in bases which are complete but not mutually unbiased. We now 
give an estimate of the relative loss due to such a non-optimal choice. First 
let us compute the determinant in the case of MUBs. We only have diagonal 
terms. Recall that a diagonal block 7^*^ (zj) = (Pf — l/n,P^ — 1/n) = 1 — 1/n. 
We write this as 7^'^ = I — 1/nT, I is the identity matrix of order n — 1 and 
T is the matrix with all entries 1. Let Fq be the submatrix of F consisting 
of the diagonal blocks. Note that all the diagonal blocks 7*^*^ are identical. 



Lemma 1 det Fq = ^ 



n + T • 



Proof. First note that T"^ = {n — 1)T. The eigenvalues of T are therefore, 
n — 1 and 0. The rank of T is 1. Hence the eigenvalues of / — 1/nT are 1/n 
and 1. The determinant of each block is therefore 1/n and since there are 
n -|- 1 blocks the result follows. 



If C is MUB then T{C) has all off diagonals zero. Now 7"^ = (I + l/nT)-^ 
I + T. Hence in block form we have, 

/ (/ + T)7" • 
detr(C) = detro-det I : : : I (14) 



That is, the off-diagonal blocks are multiplied by the matrix I + T. Consider 
7^^^ Recall that ^^\ij) = sf,- — 1/n, i,j < ra — 1, where s^- are transition 
probabilities. An easy calculation shows that (/ + T)^^^{ij) = s^- — s^!^. 
The term s^^ appear because we omitted the n*^ basis vector from each 
basis in the state space 7i. If we had chosen another vector, say, the first 
then s^[ would have been subtracted. The point is the information content 
depends on the differences of probabilities. Only in the case of MUBs are 
these differences all zero. Next we give an estimate in the general case. 

Theorem 2 Let \sfj - sf^\ < e for some e > 0. LetT' = T{C) - /„2_i and 
let Xm be the minimum eigenvalue ofV . Then 

1+Xm ^-J- > 1 (15) 



detFo 

Proof. The theorem is a direct consequence of an estimate given in jlUj . 
From its definition T{C) is positive semidefinite because it is a real matrix 
of the form < b,,bj > for vectors bj in appropriate dimension. Hence the 
estimate in IIOJ is applicable. The upper bound is just the Hadamard-Fischer 

inequality. The lower estimate in ^0] is e 1+^™ , where p = max{\\i\ : 
Aj an eigenvalue } is the spectral radius of F'. The fact that p < max{\Ri\} , 
where \Ri\ is the sum of absolute values of the entries in ith. row of F' is 
easily proved[TT]. We get n^ — n because the diagonal blocks in F' are zero. 
Let Vd = din^- ^^ ^^ illustration let e < 1/n^ then a simple calculation 
yields det F(C)/ det Fq > e"^'" and the corresponding loss in information is 
0{l/n'^). In the cases where MUBs are known to exist, that is when n is a 
prime power it is natural to expect that in some actual designing for testing 
in these bases there would be errors. If we can bound give an estimate e for 
these errors then the information loss can be estimated. Even in cases where 
MUBs are not known to exist approximate MUBs may be constructed J12j . 
However, a direct application of the above estimates to their constructions 
does not yield very good lower bounds. If e < 1/n^, as in some cases of [T^ . 
then the information loss can be estimated to be less than a = o(l). We 
now give an exact calculation of the determinant in the second construction 
in [I2j. 



3.1 Calculation of determinants in special cases 

In the case of KRSW construction 

a = b 






n+ 1 



n^ 



n^ 



a^b,i = j 



(16) 



We calculate the determinant of the (n^ — 1) x (n^ — 1) matrix defined by 
the numbers tfj = sf-, a,b = 1, . . . , n + 1 and i,j = 1, . . . , n — 1. The 
rows(columns) of the matrix are indexed by pairs [a, i]([6, j]). We do it for 
a slightly more general case. 



It is clear that sf? 



6ij. Let r be the matrix whose entries in block 



form are the (n — 1) x [n 

Let D be the matrix which contains only diagonal blocks. Thus, 



1) matrices 706 where 



D 
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Vo .. 





22 








,,?!— l,n— 1 



\ 



.. 7 
Then one can show that det T = det D ■ det F' where 



r' = 



// 


^12 





/ 







vo 





7 



^l,n+l N^ 




n— l,n— 1 



^"6 

U 



nj' 



hJ 



l,...,n-l 



(17) 



and / is the unit matrix of order n — 1. The apparent lack of symmetry in 
r' can be removed by successively subtracting the (i + 1)*'' row from the i^^ 
in each row of blocks. The result is that typical entries are of the form 



^ab 



ab 



However, for the present purpose we stay with the first form of V . We 
calculate the determinant for a very special case. Let 



"u 



c + d 



ii a ^ b and i ^ j 



(n — l)c if a 7^ 6 and i 



J 



(18) 



->ij 



if a = b. 
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Then each of the off-diagonal blocks in F' is diagonal, 



^ 



ab 



A 



T' 



-nc 




// A 

A I 



\A A 




and 



(19) 



(20) 



First, we calculate the eigenvalues of V together with their multiplicities. 
This will give us the determinant. Let 



X 



xi 



y^n+l, 



where X is a {v? — 1) column vector and Xj are (n — 1) column vectors. This 
decomposition is made to match the decomposition of the matrix F'. Then 
if X is an eigenvector with eigenvalue d. 



n+l 



T' X = dX =^ Xi — nc 2_^ = d'^'h i = 1, ■ ■ ■ ,n + 1 

Let y = Yl^=i Xj be a n — 1 column vector. Then the above equation can 
be written as 

(1 + nc — d)xi = ncy 

Consider two possibilities. First, y = 0. Then, d = 1 + nc pro. Now, the 
subspace of C" ^^ corresponding to the solutions y = is n^ — n dimensional, 
equal to the multiplicity of the eigenvalue d = j^:^ ■ The second case is when 
y 7^ 0. Then clearly all the (n — l)-vectors are equal, i.e. , 



nc 



n(n + l)c 

-,y = -^i 



1 + nc — d I + nc — d 
Hence, in this case d = 1 — n^c. The multiplicity is clearly n — 1. Thus, 

detF' = (l + nc)"'-"(l-n2c)""' 

For example, if c = 1/n^ then the determinant is zero. This is the case in 
[KRSW05]. However, notice that the main contribution to the determinant 
comes from the eigenvalue 1 + nc but remembering that detF' < 1. This 
places restriction on c. 
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4 Discussion 

We analysed the information content of a quantum state tomography in 
PVMs. The information seems to depend upon the choice of the basis vec- 
tor we ehminate initially in constructing Tf' (see discussion following eq.(|5))). 
in this case, Bl^. However, it is easy to see that the information measures 
corresponding to different choices differ by an unimportant additive con- 
stant. Another point is that the parallelepiped whose volume was used as a 
measure for information is slightly different from the one given in [2]. But 
again the corresponding information measures differ by an additive constant. 
What is perhaps more important and difficult is the weakening of assump- 
tion of uniform prior distribution and a characterization of the optimal PVM 
as a function of the prior distribution function. Another related issue is a 
similar investigation of optimal POVMs. Two other difficult problems are:l. 
estimating information content of measurements on infinite dimensional sys- 
tems and 2. incomplete measurements. These issues will be discussed in a 
forthcoming paper. 
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